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EcoRI Hpol 

GAATTCCGGT TAACTGAGGA CAAAGGGTAA TGCAGAAGTG ATATTTGATT 7CCATTCTCA 60 

Oral 

TTCCCAGTGG CCTTGATATT TAAACTGATT CCTGCCACCA GGTCCTTGGG CCACCCTGTC 120 

EspBI Sphl 

CCTGCGTCTC ATATTTCTGC ATGCTGCTTT GTTTGTATAT AGTGCGCTCC TGGCCTCAGG 180 



CTCGCTCCCC TCCAGCTCTC GCTTCATTGT TCTCCAAGTC AGAAGCCCCC GCATCCGCCG 240 

BssHII 

CGCAGCAGCG TGAGCCGTAG TCACTGCTGG CCGCTTCGCC TGCGTGOGCG CACGGAAATC 300 



GGGGAGCCAG GAACCCAAGG AGCCGCCGTC CGCCCGCTGT GCCTCTGCTA GACCACTCGC 360 



AGCCCCAGCC TCTCTCAAGC GCACCCACCT CCGCGCACCC CAGCTCAGGC GAAGCTGGAG 420 



TGAGGGTGAA TCACCCTTTC TCTAGGGCCA CCACTCTTTT ATCGCCCTTC CCAAGATTTG 480 

Eco47III Aatll 
AGAAGCGCTG CGGGAGGAAA GACGTCCTCT TGATCTCTGA CAGGGCGGGG TTTACTGCTG 540 

BssHl 1 1 
Pstl 

TCCTGCAGGC GCGCCTCGCC TACTGTGCCC TCCGCTACGA CCCCGGACCA GCCCAGGTCA 600 

BspHI 

OGTCCGTGAG AAGGGATCAT GAAGCACTTG GTAGCAGCCT GGCTTTTGGT TGGACTCAGC 660 

MKHLVAAWLLVGLS 

CTCGGGGTGC CCCAGTTCGG CAAAGGTGAC ATTTGCAACC CGAACCCCTG TGAAAATGGT 720 
LGVP QFG KGD ICNP NPC ENG 
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BspMI 

GGCATCTGTC TGTCAGGACT GGCTGATGAT TCCTTTTCCT GTGAGTGTCC AGAAGGCTTC 780 
GICL SGL A00 SFSC ECP EGF 

. BspMI 

GCAGGTCCGA ACTGCTCTAG TGTTGTGGAG GTTGCATCAG ATGAAGAAAA GCCTACTTCA 840 
AGPN CSS VVE VASO EEK PTS 

GCAGGTCCCT GCATCCCTAA CCCATGCCAT AACGGAGGAA CCTGTGAGAT AAGCGAAGCC 900 
AGPC IPN PCH NGGT CEl SEA 

TATCGAGGAG ACACATTCAT AGGCTATGTT TGTAAATGTC CTCGGGGATT TAATGGGATT 960 
YRGD TFI GYV CKCP RGF NCI 

CACTGTCAGC AGAATATAAA TGAATGTGAA GCTGAGCCTT GCAGAAATGG CGGAATATGT 1020 
HCOH NIN E C E AEPC RNG GIC 



BsmI 

ACCGACCTTG TTGCTAACTA CTCTTGTGAA TGCCCAGGAG AATTTATGGG ACGAAATTGT 1080 
TDLV ANY SCE C PC E FMG RNC 

CAATATAAAT GCTCTGGGCA CTTGGGAATC GAAGGTGGGA TCATATCTAA TCAGCAAATC 1140 
QYKC SGH L G I EGG I ISN Q Q I 

Sacl 
Eel 1361 1 

ACAGCTTCAT CTAATCACCG AGCTCTTTTT GGACTCCAGA AGTGGTATCC CTACTATGCT 1200 
TASS NHR A I F GLQK WYP YYA 

Ncol 
Mscl 

PvuII Boll 

CGACTTAATA AGAAGGGCCT TATAAATGCC TGGACAGCTG CTGAAAATGA CAGATGGCCA 1260 
RLNK KGL INA W T A A END RWP 

TGGATTCAGA TAAATTTGCA AAGAAAAATG AGAGTCACTG GTGTTATTAC CCAAGGAGCA 1320 
W I 0 I N L Q R K M R V T G V I T Q G A 

AAAAGGATTG GAAGCCCAGA GTACATAAAA TCCTACAAAA TTGCCTACAG CAATGACGGG 1380 
KRIG SPE YIK SYKI AYS NOG 
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Bbs I Earl 

AAGACCTGGG CAATGTACAA AGTAAAAGGC ACCAATGAAG AGATGGTCTT TCGTGGAAAT 1440 
KTWA MYK VKG TNEE MVF RGN 



Ndel 

GTTGATAACA ACACACCATA TGCTAATTCT TTCACACCCC CAATCAAAGC TCAGTATGTA 1500 
VONN TPY ANS FTPP IKA QYV 

AGACTCTACC CCCAAATTTG TCGAAGGCAT TGTACTTTAA GAATGGAACT TCTTGGCTGT 1560 
RLYP QIC RRH CTLR MEL LGC 

Sod 
Eel 1361 1 

GAGCTCTCAG GCTGTTCAGA ACCTTTGGGG ATGAAATCAG GGCATATACA AGACTACCAG 1620 
ELSG CSE PLG MKSG HIQ DYQ 



Bbsl 

ATCACTGCCT CCAGCGTCTT CAGAACACTC AACATGGACA TGTTTACTTG GGAACCAAGG 1680 
1TAS SVF RTL NMDM FTW EPR 

AAAGCCAGGC TGGACAAGCA AGGCAAAGTA AATGCCTGGA CTTCCGGCCA TAACGACCAG 1740 
KARL DKQ GKV NAWT SGH NOQ 

TCACAATGGT TACAGGTTGA TCTTCTTGTC CCTACTAAGG TGACAGGCAT CATTACACAA 1800 
SQWL QVD LLV PTKV TGI ITQ 



Pmll 

GGAGCTAAAG ATTTTGGTCA CGTGCAGTTT GTTGGGTCAT ACAAACTAGC TTACAGCAAT I860 
GAKD FGH VQF VGSY KLA YSN 



ApaLI 

GATGGAGAAC ACTGGATGGT GCACCAGGAT GAAAAACAGA GGAAAGACAA GGTTTTTCAA 1920 
DGEH WMV HOD EKQR KOK VFO 

GGCAATTTTG ACAATGACAC TCACAGGAAA AATGTCATCG ACCCTCCCAT CTATGCACGA 1980 
GNFD NDT HRK NVIO PPI YAR 
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TTCATAAGAA TCCTTCCTTG GTCCTGGTAT GGAAGGATCA CTCTGCGGTC AGAGCTGCTG 2040 
FIRI LPW SWY GRIT LRS ELL 



Fspl 

GGCTGCGCAG AGGAGGAATG AAGTGCGGGG CCGCACATCC CACAATGCTT TTCTTTATTT 2100 
G C A E E E 

TCCTATAAGT ATCTCCACGA AATGAACTGT GTGAAGCTGA TGGAAACTGC ATTTGTTTTT 2160 

Hindi 1 1 

TTCAAAGTGT TCAAATTATG GTAGGCTACT GACTGTCTTT TTAGGAGTTC TAAGCTTGCC 2220 
TTTTTAATAA TTTAATTTGG TTTCCTTTGC TCAACTCTCT TATGTAATAT CACACTGTCT 2280 

Eorl 

GTGAGTTACT CTTCTTGTTC TCT 2303 
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9 18 27 36 45 54 

TCT CTT TAG TCA CCA CTC TC6 CCC TCT CCA AGA ATT TGT TTA ACA AAG GCG TGA 



SL*S PLSPSPRICL TKR * 
63 72 81 90 99 108 

GGA AAG AGA ACG TCT TCT TGA ATT CTT TAG TAG GGG CGG AGT CTG CTG CTG CCC 

■ * 



GKRT SS* IL* *GRSLL LP 
117 126 135 144 153 162 

TGC GCT GCC ACC TCG GCT ACA CTG CCC TCC GCG ACG ACC CCT GAC CAG CCG GGG 



C A ATS A T L P S A T T P D Q P G 
171 180 189 198 207 216 

TCA CGT CCG GGA GAC GGG ATC ATG AAG CGC TCG GTA GCC GTC TGG CTC TTG GTC 



SRPGDG IMKRSVAVWLLV 
225 234 243 252 261 270 

GGG CTC AGC CTC GGT GTC CCC CAG TTC GGC AAA GGT GAT ATT TGT GAT CCC AAT 



GL S L GVPQFGKGDICDPN 
279 288 297 306 315 324 

CCA TGT GAA AAT GGA GGT ATC TGT TTG CCA GGA TTG GCT GTA GGT TCC TTT TCC 



PCENG G ICLPGL A VGSFS 
333 342 351 360 369 378 

TGT GAG TGT CCA GAT GGC TTC ACA GAC CCC AAC TGT TCT AGT GTT GTG GAG GTT 



CECP D GFTDPNCSSV VEV 
387 396 405 414 423 432 

GCA TCA GAT GAA GAA GAA CCA ACT TCA GCA GGT CCC TGC ACT CCT AAT CCA TGC 



A SDE E E PTSAGPCTPN P C 
441 450 459 468 477 486 

CAT AAT GGA GGA ACC TGT GAA ATA AGT GAA GCA TAC CGA GGG GAT ACA TTC ATA 



HN GGTC E ISEA YR G OTF I 
495 504 513 522 531 540 

GGC TAT GTT TGT AAA TGT CCC CGA GGA TTT AAT GGG ATT CAC TGT CAG CAC AAC 



G Y V C K C PR GFNG I HCQHN 
549 558 567 576 585 594 

ATA AAT GAA TGC GAA GTT GAG CCT TGC AAA AAT GGT GGA ATA TGT ACA GAT CTT 



I N EC E V E P C KNGG I C TD L 
603 612 621 630 639 648 

GTT GCT AAC TAT TCC TGT GAG TGC CCA GGC GAA TTT ATG GGA AGA AAT TGT CAA 



VANYSCECPGEFMGRNCQ 

FIG. 4A 
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657 666 675 684 693 702 

TAC AAA TGC TCA GGC CCA CTG GGA ATT GAA GGT GGA ATT ATA TCA AAC CAG CAA 



YKCSGPLG IEGGI I SNQQ 
711 720 729 738 747 756 

ATC ACA GCT TCC TCT ACT CAC CGA GCT CTT TTT GGA CTC CAA AAA TGG TAT CCC 



I T A S S T H R A L F G L Q K W Y P 
765 774 783 792 801 810 

TAC TAT GCA CGT CTT AAT AAG AAG GGG CTT ATA AAT GCG TGG ACA GCT GCA GAA 



YYAR LNK KGL I NAW.TAAE 
819 828 837 846 855 864 

AAT GAC AGA TGG AAC CGG TGG ATT CAG ATA AAT TTG CAA AGA AAA ATG AGA GTT 



N DRW N RWIQ INLQ RKMRV 
873 882 891 900 909 918 

ACT GGT GTG ATT ACC CAA GGG GCC AAG AGG ATT GGA AGC CCA GAG TAT ATA AAA 



TG V I TQG AKR I GS P E Y I K 
927 936 945 954 963 972 

TTC TAC AAA ATT GCC TAC AGT AAT GAT GGA AAG ACT TGG GCA ATG TAC AAA GTG 



FYK IAYSNDGK TWAMYKV 
981 990 999 1008 1017 1026 

AAA GGC ACC AAT GAA GAC ATG GTG TTT CGT GGA AAC ATT GAT AAC AAC ACT CCA 



KGTNEDMVFRG N I DNNTP 
1035 1044 1053 1062 1071 1080 

TAT GCT AAC TCT TTC ACA CCC CCC ATA AAA GCT CAG TAT GTA AGA CTC TAT CCC 



YAN S FTPPIKAQ Y V RLYP 
1089 1098 1107 1116 1125 1134 

CAA GTT TGT CGA AGA CAT TGC ACT TTG CGA ATG GAA CTT CTT GGC TGT GAA CTG 



QVCRR HCTLRMELLGC EL 
1143 1152 1161 1170- 1179 1188 

TCG GGT TGT TCT GAG CCT CTG GGT ATG AAA TCA GGA CAT ATA CAA GAC TAT CAG 



SGCSEPLGMKSGHIQDYQ 
1197 1206 1215 1224 1233 1242 

ATC ACT GCC TCC AGC ATC TTC AGA ACG CTC AAC ATG GAC ATG TTC ACT TGG GAA 



I TASS I F RTLNMDM FTW E 
1251 1260 1269 1278 1287 1296 

CCA AGG AAA GCT CGG CTG GAC AAG CAA GGC AAA GTG AAT GCC TGG ACC TCT GGC 



PRKARL DKQ GKVN AW TSG 

FIG. 4B 
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1305 1314 1323 1332 1341 1350 

CAC AAT GAC CAG TCA CAA TGG TTA CAG GTG GAT CTT CTT GTT CCA ACC AAA GTG 



HNDQS QWLQVDLL V P TKV 
1359 1368 1377 1386 1395 1404 

ACT GGC ATC ATT ACA CAA GGA GCT AAA GAT TTT GGT CAT GTA CAG TTT GTT GGC 



TGI I T Q G A K D F G H V Q F V G 
1413 1422 1431 1440 1449 1458 

TCC TAC AAA CTG GCT TAC AGC AAT GAT GGA GAA CAC TGG ACT GTA TAC CAG GAT 



SY KLAYSNDGEH WTV YQD 
1467 1476 1485 1494 1503 1512 

GAA AAG CAA AGA AAA GAT AAG GTT TTC CAG GGA AAT TTT GAC AAT GAC ACT CAC 

E K Q R K D K V F Q G N F D N D T H 
1521 1530 1539 1548 1557 1566 

AGA AAA AAT GTC ATC GAC CCT CCC ATC TAT GCA CGA CAC ATA AGA ATC CTT CCT 



R K N VIDPPI YARHIRILP 
1575 1584 1593 1602 1611 1620 

TGG TCC TGG TAC GGG AGG ATC ACA TTG GCG TCA GAG CTG CTG GGC TGC ACA GAG 

WSWYGRITLASELLGCTE 
1629 1638 1647 1656 1665 1674 

GAG GAA TGA GGG GAG GCT ACA TTT CAC AAC CGT CTT CCC TAT TTG GGT AAA AGT 



EE*GEAT. FHNRLPYL-GKS 
1683 1692 1701 1710 1719 1728 

ATC TCC ATG GAA TGA ACT GTG TAA AAT CTG TAG GAA ACT GAA TGG TTT TTT TTT 



ISME *TV *N L*ETEW FFF 
1737 1746 1755 1764 1773 

rrr tca tga aaa agt ggt caa ah atg gta ggc aac taa cgg tgt ttt tac c 3' 



FS*KSGQ IM VGN*RCFY 



FIG. 4C 
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M K RSVAVtJI LUSJ SI G V P 0 F G KG 0 I 



FIG. 9 



1) CDPNPCENGGICLPGLAVG SFSCECPDGFTDPNCS S VV E VASDE E E PTSAGP 

2) CTPNPCHNGGTCE I SEAYRGDTF I GYVCKCPRGFNG I HCQ HNINE 

3) CEVEPCKNGGICTDLVA- - - - - - - NYSCECPGEFMGRNCQ YK 

CONSENSUS C — PC-NGG-C — Y-C-C--GY-G--C- 

EGF DOMAIN F F 



FIG. 10 

9 18 27 36 45 54 

5' -GT GAT ATT TGT GAT CCC AAT CCA TGT GAA AAT GGA GGT ATC TGT TTG CCA GGA 



X D I C D P N P C E N G G I C L P G 
63 72 81 90 99 108 

TTG GCT GTA GGT TCC TTT TCC TGT GAG TGT CCA GAT GGC TTC ACA GAC CCC AAC 



LAV G SFSCECP DGFTDPN 
117 126 135 144 153 162 

TGT TCT AGT GTT GTG GAG GTT GGT CCC TGC ACT CCT AAT CCA TGC CAT AAT GGA 



CSSVVE VGPCT P NPCHNG 
171 180 189 198 207 216 

GGA ACC TGT GAA ATA AGT GAA GCA TAC CGA GGG GAT ACA TTC ATA GGC TAT GTT 



G T C E I S E A Y R G D T F I G Y V 
225 234 243 252 216 270 

TGT AAA TGT CCC CGA GGA TTT AAT GGG ATT CAC TGT CAG CAC AAC ATA AAT GAA 



CKC P RG FNG I HC QH N I N E 
279 288 297 306 

TGC GAA GTT GAG CCT TGC AAA AAT GGT GGA ATA TGT ACA G 3 * 



CEVEP CK N GGICT 
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EcoRI Sac 1 1 Apal 

GAATTCCGGG AGGGAGGGTA GGGGGGCGGG CCGCGGGGGC CCAAAGCCAG CTAGGCTCAG 60 



TCTCACACGC GCGCCGCCAC TGTTTGTATA TAGTGCGCTC CTGGCCTCAG GCTCGCTCCC 120 



CTCCAGCTCT CGCTTCATTG TTCTCCAAGT CAGAAGCCCC CGCATCCGCC GCGCAGCAGC 180 



GTGAGCCGTA GTCACTGCTG GCCGCTTCGC CTGCGTGCGC GCACGGAAAT CGGGGAGCCA 240 



GGAACCCAAG GAGCCGCCGT CCGCCCGCTG TGCCTCTGCT AGACCACTCG CAGCCCCAGC 300 



CTCTCTCAAG CGCACCCACC TCCGCGCACC CCAGCTCAGG CGAAGCTGGA GTGAGGGTGA 360 

Eco47III 

ATCACCCTTT CTCTAGGGCC ACCACTCTTT TATCGCCCTT CCCAAGATTT GAGAAGCGCT 420 

Aatll PstI 
GCGGGAGGAA AGACGTCCTC TTGATCTCTG ACAGGGCGGG GTTTACTGCT GTCCTGCAGG 480 



CGCGCCTCGC CTACTGTGCC CTCCGCTACG ACCCCGGACC AGCCCAGGTC ACGTCCGTGA 540 



BspHI 



GAAGGGATCA TGAAGCACTT GGTAGCAGCC TGGCTTTTGG TTGGACTCAG CCTCGGGGTG 600 
M KHL V A A WLLV GLS LGV 

CCCCAGTTCG GCAAAGGTGA CATTTGCAAC CCGAACCCCT GTGAAAATGG TGGCATCTGT 660 
POFG KGO ICN PNPC ENG GIC 



BspMI 



CTGTCAGGAC TGGCTGATGA TTCCTTTTCC TGTGAGTGTC CAGAAGGCTT CGCAGGTCCG 720 
LSGL ADO SFS CECP EGF AGP 
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BspMI 

AACTGCTCTA GTGTTGTGGA GGTTGCATCA GATGAAGAAA AGCCTACTTC AGCAGGTCCC 780 
NCSS V V E V A S DEEK PTS AGP 

TGCATCCCTA ACCCATGCCA TAACGGAGGA ACCTGTGAGA TAAGCGAAGC CTATCGAGGA 840 
CIPN PCH NGG TCEI SEA YRG 

GACACATTCA TAGGCTATGT TTGTAAATGT CCTCGGGGAT TTAATGGGAT TCACTGTCAG 900 
DTFI GYV CKC PRGF N G I HCQ 

CACAATATAA ATGAATGTGA AGCTGAGCCT TGCAGAAATG GCGGAATATG TACCGACCTT 960 
HNIN EC E AEP CRNG GIC TDL 



Bsmi 

GTTGCTAACT ACTCTTGTGA ATGCCCAGGA GAATTTATGG GACGAAATTG TCAATATAAA 1020 
VANY SCE CP C EFMG RNC QYK 

TGCTCTGGGC ACTTGGGAAT CGAAGGTGGG ATCATATCTA ATCAGCAAAT CACAGCTTCA 1080 
CSGH LGI EGG I I S N 0 0 I TAS 



Sac I 
Eel 1361 1 

TCTAATCACC GAGCTCTTTT TGGACTCCAG AAGTGGTATC CCTACTATGC TAGACTTAAT 1140 
SNHR ALF GLQ KWYP YYA RLN 

Ncoi 
MscI 

PvuII Boll 

AAGAAGGGCC TTATAAATGC CTGGACAGCT GCTGAAAATG ACAGATGGCC ATGGATTCAG 1 200 
KKGL INA WTA AEND RWP WIQ 



GTAACAGTGG GATGAGACAA ATCCATTTCC CAAATTATCA GAATCATTAT AGAAGTAGGT 1260 
VTVG 

TAGGGAGAAT TGGCTGTGAT TCTTTCTCAT GGTTAAAATG TGATTTAGTT CAGAATTAAC 1320 
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ATGGTTGGAA ACTCTAAAAA ATGTGGAAAA CAGGAACATT CTATGTCTGA AAATCTGAAA 1380 



ATAGCATCAA GATGAAAACA TTCTTTAGTC ATAAATATAC TCTTTTAAGT TATAGTAGAG 1440 
Bglll 

AAAAAGATCT TATCATTTCA TAAGTGGACT TTTGGGATAG CATTGGAAAT GTAAATGAAA 1500 

Sspl 

TAAATACCTA ATTGAAAAAA GTTTATTCTA AAGTGTTAAT ATTTAGCAAC AGATTCAGAG 1560 



ACAAGAAAGT AACAATTCAA TCTGTGTATT TTTTGTGAGA AATAGTTTCC CATGTGCAAA 1620 

Fspl BspHI Pstl 
TAT AAAG TGcTcATCAT ATC ATGATAATAT CCAACTGTCT GCAGAACTCC CTTTCATAAA 1680 



TGAGAGAATT TTAATTCATA GTGCCTTATA TCCTCATCAG CCATCTGACT TTACTACAGA 1740 

Nsil 

AGAAAACAAT GAAATGATGC ATTAAGTGCT TTGCTAGAAG AAACATCATA GCAAAGCTGA 1800 

Xhol 

Hindlll PoeR7I 
TAGCCCACAT TCTGTGCANN NAAGCTTCCA GAGCACTCGA GAAAAAGCAG AAATGAGATG 1860 

Bell 

TTTTATGAAA ACCGAAAAGA TAATCTGATT TCTGTGAAAT ATACTTTTGA TCATGTGGTT 1920 

CTTTAAGATA GTCACTAACA AGTCATTAGT AGCAGATACC AAATGGGAGA AAATTTCCAG 1980 
Bst1107I 

TATACTGAGG GTCAAGGCAG TCATGCTGAA ACTACATGAG GTCAGGAAAG TTTTGAAATA 2040 
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AGGTGATTTT GGAAGGATAC CTTCAACTGG CCTAGATTTT CAAGAAACAG TGTAATCAAC 2100 

AGCCAAACAT GAGMTCTAG CTAACAGCAT TTAGAAAACC AGAACTAAGA GTGTTACTGG 2160 

Oral 

GGAATTGCAT TTAAATCCAG TATGAGAGTT TGCAAATGCC GTATTCTTCT AAGGGGTTTG 2220 

Ncol 

TGCCACATTT TGTTACCATG GAGTCCTCTG TAAGAACTTT ATTAGATAAA TCATCTTTAC 2280 

EcoRI 

ACTATAATTT GAATAAAAGC CGGAATTC 2308 
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